-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Low precision types support in Convert operation #5640
Conversation
9d08366
to
daebf97
Compare
@ilyachur @jane-intel @pelszkow guys I think we should introduce general support of LP types to ngraph. We should consider introduction of |
The LP types are introduced in order to minimize the size of weights. If we want to work with vector of |
I was thinking about specialization like with vector of bool which would allow us to keep the same memory usage. My main point is that we should have some common place for code handling LP types in ngraph, implementation details we can discuss when we agree that it's needed. If you want to think about it on concreate use case you can look at TestCase class and think how to properly introduce LP types handling there (at this point they are not handled). |
Yeah, I know about the vector of bool, but unfortunately we also have |
I am really bad in communicating today :) I think it is technically possible to implement our own specialization of vector, e.g. |
It looks like we should provide some tool which will be some mix of
|
Oh, ok, now I got your point. So in this case, I think we need to try to implement it. |
Ok, so just to be on the same page, I'll create separate ticket for this. Let's merge this PR as is, including your comments ofz. |
@ilyachur any chance you will review today? I'd love to merge it before FF :) |
@jdanieck I merged this PR, can we add more tests in the next PR? |
* Add initial version of u1 type support. * Turn off u8_to_u1 test in IE.CPU. * Fix compilation issue. * Replace std::memset with std::fill. * Add u4 type support. * Add i4 support. * LP types support generalized. * Remove std::copy optimization. * Fix backend test for LP types. * Fixed arm plugin compilation. * Add LP types to Serialization SLT. * Add Convert to summarize.py report.
* Add initial version of u1 type support. * Turn off u8_to_u1 test in IE.CPU. * Fix compilation issue. * Replace std::memset with std::fill. * Add u4 type support. * Add i4 support. * LP types support generalized. * Remove std::copy optimization. * Fix backend test for LP types. * Fixed arm plugin compilation. * Add LP types to Serialization SLT. * Add Convert to summarize.py report.
Details:
Tickets: